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Abstract 

In this paper we show that an ac- 
count for coordination can be con- 
structed using the derivation struc- 
tures in a lexicalized Tree Adjoin- 
ing Grammar (LTAG). We present a 
notion of derivation in LTAGs that 
preserves the notion of hxed con- 



stituency in the LTAG lexicon while 



providing the flexibility needed for 
coordination phenomena. We also 
discuss the construction of a practi- 
cal parser for LTAGs that can han- 
dle coordination including cases of 
non-constituent coordination. 



1 Introduction 

Lexicalized Tree Adjoining Grammars 
(LTAG) and Combinatory Categorial Gram- 
mar (CCG) ( Steedman, 199^ ) are known to 
be weakly equivalent but not strongly equiv- 
alent. Coordination schema have a natural 
description in CCG, while these schema have 
no natural equivalent in a standard LTAG. 



In (Joshi and Schabes, 1991) it was shown 
that in principle it is possible to construct 
a CCG-like account for coordination in the 
framework of LTAGs, but there was no clear 
notion of what the derivation structure would 
look like. In this paper, continuing the work 
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of flJoshi and Schabes, 1991 ), we show that an 
account for coordination can be constructed 
using the derivation structures in an LTAG. 

Using the notions given in this paper we 
also discuss the construction of practical 
parser for LTAGs that can handle coordina- 
tion including cases of non-constituent coordi- 
nation. This approach has been implemented 



in the XTAG system (XTAG Research Group, 



1995|) thus extending it to handle coordina- 



tion. This is the first full implementation of 
coordination in the LTAG framework. 

2 LTAG 

An LTAG is a set of trees (elementary trees) 
which have at least one terminal symbol on 
its frontier called the anchor. Each node in 
the tree has a unique address obtained by ap- 
plying a Gorn tree addressing scheme, shown 
in the tree a (cooked) (Fig. |l]). Trees can 
be rewritten using substitution and adjunc- 
tion. A history of these operations on ele- 
mentary trees in the form of a derivation tree 
can be used to reconstruct the derivation of 
a string recog nized by a LTAG. In Fig. [I], 
the tree (3 (dried) adjoins into a (beans) and 
trees a (John) and a (beans) substitutes into 
a (cooked) to give a derivation tree for John 
cooked dried beans. Each node in the deriva- 
tion tree is the name of an elementary tree. 
The labels on the edges denote the address in 
the parent node where a substitution or ad- 
junction has occured. 

3 Trees as Structured Categories 



In flJoshi and Schabes, 1991| ) elementary trees 
as well as derived trees in an LTAG were con- 



TAGG: a (cooked) a (John) a(beans) p (dried) 




P(dried) dr ! cd b< , ans 



Derivation Tree Derived Tree 

Figure 1: Example of an LTAG and an LTAG 
derivation 



preserving the linear precedence of the 
arguments. 

2. coordinating the VP nodes, which are the 
least nodes dominating the two contigu- 
ous strings. 

3. collapsing the supertrees above the VP 
node. 

4. selecting the leftmost NP as the lexi- 
cal site for the argument, since prece- 
dence with the verb is maintained by this 
choice. 



sidered as structured categories defined 
3-tuple of an elementary or derived tree, the 
string it spanned and the functional type of 
the tree, e.g {<J\,l\,T\) in Fig. [2|. Functional 
types for trees could be thought of as defin- 
ing un-Curried functions corresponding to the 
Curried CCG counterpart. A functional type 
was given to sequences of lexical items in 
trees even when they were not contiguous; i.e. 
discontinuous constituents were also assigned 
types. They were, however, barred from co- 
ordinating. 



1 j eats cookies 
t. NP — s- S 



Figure 2: Structured Category for eats cookies 

Coordination of two structured categories 
cri,cJ2 succeeded if the lexical strings of both 
categories were contiguous, the functional 
types were identical, and the least nodes dom- 
inating the strings spanned by the component 
tree have the same label. For example, in 
Fig. H the tree corresponding to eats cookies 
and drinks beer would be obtained by: 

1. equating the NP nodesQ in o\ and 02, 



1 This notion of sharing should not be confused 
with a deletion type analysis of coordinatio n. The 
scheme presented in (Joshi and Schabes, 1991) as well 




11: eats cookies 
Tl:NP^» S 



12: drinks beer 
T2: NP - S 



eats cookies drinks 
1: eats cookies and drinks beer 
I: NP —a- S 



Figure 3: Coordination of eats cookies and 
drinks beer 

The process of coordination built a new de- 
rived structure given previously built pieces 
of derived structure (or perhaps elementary 
structures). There is no clear notion of a 
derivation structure for this process. 

4 Coordination in TAG 

An account for coordination in a standard 
LTAG cannot be given without introducing 
a notion of sharing of arguments in the two 
lexically anchored trees because of the notion 



of locality of arguments in LTAG. In for 
instance, the NP the beans in the "right node 
raising" construction has to be shared by the 
two elementary trees (anchored by cooked and 
ate respectively). 

(1) (((Harry cooked) and (Mary ate)) 
the beans) 

We introduce a notation that will enable 
us to talk about this more formally. In Fig. [I] 
the notation J, denotes that a node is a non- 
terminal and hence expects a substitution op- 
eration to occur. The notation * marks the 



as the analysis presented in this paper are not deletion 
analyses. 



foot node of an auxiliary tree. Making this 
explicit we can view an elementary tree as a 
ordered pair of the tree structure and a or- 
dered set^ of such nodes from its frontier^], 
e.g. the tree for cooked will be represented 
as (a(cooked), {1, 2.2}). Note that this rep- 
resentation is not required by the LTAG for- 
malism. The second projection of this ordered 
pair is used here for ease of explication. Let 
the second projection of the pair minus the 
foot nodes be the substitution set. We will 
occasionally use the first projection of the el- 
ementary tree to refer to the ordered pair. 

Setting up Contractions. We introduce an 
operation called build- contraction that takes 
an elementary tree, places a subset from its 
second projection into a contraction set and 
assigns the difference of the set in the second 
projection of the original elementary tree and 
the contraction set to the second projection of 
the new elementary tree. The contents of the 
contraction set of a tree can be inferred from 
the contents of the set in the second projec- 
tion of the elementary tree. Hence, while we 
refer to the contraction set of an elementary 
tree, it does not have to be stored along with 
its representation. 

Fig. H] gives some examples; each node 
in the contraction set is circled in the fig- 
ure. In the tree (a(cooked), {1, 2.2}) appli- 
cation of the operation on the NP node at 
address 2.2 gives us a tree with the con- 
traction set {2.2}. The new tree is denoted 
by (a(cooked) {2.2} > {1})) or a(cooked) { 2 .2} for 
short. Placing the NP nodes at addresses 1 
and 2.2 of the tree a(cooked) into the con- 
traction set gives us a(cooked) {^2.2} • 



a(cooked) 




c/t'cooked) S 
{1, 2.2 




Figure 4: Building contraction sets 



2 The ordering is given by the fact that the ele- 
ments of the set are Gorn addresses. 

3 We shall assume there are no adjunction con- 
straints in this paper. 



We assume that the anchor cannot be in- 
volved in a build- contraction. This assump- 
tion needs to be revised when gapping is con- 
sidered in this framework (§||). 

The Coordination Schema. We use the 
standard notion of coordination shown in 
Fig. |5] which maps two constituents of like 
type, but with different interpretations, into 
a constituent of the same typeQ. 



Con] 



Figure 5: Coordination schema 

We add a new rewriting operation to the 
LTAG formalism called conjoin^ While sub- 
stitution and adjunction take two trees to give 
a derived tree, conjoin takes three trees and 
composes them to give a derived tree. One of 
the trees is always the tree obtained by spe- 
cializing the schema in Fig. [| for a particular 
category^. 

Informally, the conjoin operation works as 
follows: The two trees being coordinated are 
substituted into the conjunction tree. This 
notion of substitution differs from the tra- 
ditional LTAG substitution operation in the 
following way: In LTAG substitution, always 
the root node of the tree being substituted 
is identified with the substitution site. In 
the conjoin operation however, the node sub- 
stituting into the conjunction tree is given 
by an algorithm, which we shall call Find- 
Root that takes into account the contrac- 
tion sets of the two trees. FindRoot returns 
the lowest node that dominates all nodes in 
the substitution set of the elementary treeQ, 



4 In this paper, we do not consider coordination of 
unlike categories, e.g. Pat is a Republican and 



of it. flSarkar and .Toshi. 199r| discusses such cases, 



Jorgcnscii and Abeillc (1992|) 



following 

Later we will discuss an alternative which re- 
places this operation by the traditional operations of 
substitution and adjunction. 

6 The tree obtained will be a lexicalized tree, with 
the lexical anchor as the conjunction: and, but, etc. 

7 This ensures the node picked by FindRoot always 
dominates a contiguous string in a derivation. This 
cap tures the string contigui ty condition that was used 
in (Joshi and Schabes, 1991). A coordinated node will 
never dominate multiple foot nodes. Such a case oc- 



e.g. FindRoot(a(cooked) {2.2}) will return the 
root node, i.e. corresponding to the S conj 
S instantiation of the coordination schema. 
FindRoot(a(cooked) {1,2.2}) will return node 
address 2.1, corresponding to the V conj V 
instantiation. 

The conjoin operation then creates a con- 
traction between nodes in the contraction sets 
of the trees being coordinated. The term con- 
traction is taken from the graph-theoretic no- 
tion of edge contraction. In a graph, when 
an edge joining two vertices is contracted, the 
nodes are merged and the new vertex retains 
edges to the union of the neighbors of the 
merged vertices^. The conjoin operation sup- 
plies a new edge between each corresponding 
node in the contraction set and then contracts 
that edge. As a constraint on the application 
of the conjoin operation, the contraction sets 
of the two trees must be identical. 

Another way of viewing the conjoin op- 
eration is as the construction of an aux- 
iliary structure from an elementary tree. 
For example, from the elementary tree 
(a (drinks), {1,2.2}) , the conjoin opera- 
tion would create the auxiliary structure 
(P (drinks) hx, {2.2}) shown in Fig. ||. The 
adjunction operation would now be responsi- 
ble for creating contractions between nodes in 
the contraction sets of the two trees supplied 
to it. Such an approach is attractive for two 
reasons. First, it uses only the traditional op- 
erations of substitution and adjunction. Sec- 
ondly, it treats conj X as a kind of "modifier" 
on the left conjunct X. We do not choose be- 
tween the two representations but continue 
to view the conjoin operation as a part of our 

folJXlclllSIIl 

Tor example, applying conjoin to the trees 
Conj(and), a(eats){i\ and ot(dr inks) m gives 
us the derivation tree and derived structure 
for the constituent in [2] shown in Fig. ^. 



curs, e.g., two auxiliary trees with substitution nodes 
at the same tree address are coordinated with only the 
substitution nodes in the contraction set. 

8 Merging in the graph-theoretic definition of con- 
traction involves the identification of two previously 
distinct nodes. In the process of contraction over 
nodes in elementary trees it is the operation on 
that node (either substitution or adjunction) that is 
identified. 




Figure 6: Coordination as adjunction. 



(2) ... eats cookies and drinks beer. 



Conj(and) yp 



VP and VP 
Conj(and) 





a (eats) cc (drinks) ^ 

2.2I J 2.2 

a (cookies) a (beer) 

Derivation tree 



eats cookies drinks beer 

Derived structure 



Figure 7: An example of the conjoin opera- 
tion. 

In Fig. the nodes a(eats)u\ and 
a(drinks){i\ signify an operation left incom- 
plete at address 1. 

The Effects of Contraction. One of the 
effects of contraction is that the notion of 
a derivation tree for the LTAG formalism 
has to be extended to an acyclic derivation 
grapf^. Simultaneous substitution or adjunc- 
tion modifies a derivation tree into a graph as 
can be seen in Fig. ||. 

If a contracted node in a tree (after the con- 
join operation) is a substitution node, then 
the argument is recorded as a substitution 
into the two elementary trees as for example 



in the sentences |(3]j and |(4)| . 

Chapman eats cookies and drinks 
beer. 

Keats steals and Chapman eats ap- 
ples. 



(3) 
(4) 



Fig. ^ contains the derivation and derived 
structures for (3} and Fig. |9| for (4). In Fig. |9| 
the derivation graph for sentence [4] accounts 
for the coordinations of the traditional non- 
constituent "Keats steals" by carrying out the 

9 We shall use the general notation derivation 
structure to refer to both derivation trees and deriva- 
tion graphs. 



coordination at the root, i.e. S conj S. No likes and Chapman thinks Mary likes beans, 
constituent corresponding to "Keats steals" Chapman thinks simply adjoins into the right 
is created in the process of coordination. conjunct of the coordinate structure^]. 



Conj (and) 
QE(eats) (1] a (drinks), 1} NP 
a(cookies) a(Chapman) a(beer) Chapman y 

Derivation structure 




Figure 8: Derivation for Chapman eats cook- 
ies and drinks beer. 



s 

NP 

ct(Keats) a (apples) a(Chapman) Keats V Chapman y NP 

steals eats apples 

Derivation structure Derived structure 



Conj(and) 

ct(stealsj o(eats) |2 



Elementary trees 
S 




happy 



Derived structure 




Derivation structure 

Conj(and) 
ct(laughed) cc(happy) 



a(who) p(be) 

o 

|5(to) 

o 

pXseemed) 



Figure 10: Derivation for Who laughed and 
seemed to be happy? 



Figure 9: Derivation for Keats steals and 
Chapman eats apples. 

The derived structures in Figs. |8| and |9| are 
difficult to reconcile with traditional notions 
of phrase structure^. However, the deriva- 
tion structure gives us all the information 
about dependency that we need about the 
constituents. The derivation encodes exactly 
how particular elementary trees are put to- 
gether. Obtaining a tree structure from a de- 
rived structure built by the conjoin operation 



is discussed in ( Sarkar and Joshi, 1996 ). 

Considerations of the locality of move- 
ment phenomena and its representation in 
the LTAG formalism ( Kroch and Joshi, 1986| ) 
can also now explain constraints on coordi- 
nate structure, such as across-the-board ex- 
ceptions to the well known coordinate struc- 
ture constraint, see Fig. [H]. Also in cases of 
unbounded right node raising such as Keats 



McCawley (1982 ) raised the heterodox view that 



a discontinuous constituent structure should be given 
for right node raising cases, having the same notion of 
constituency as our approach. However, no conditions 
on the construction of such a structure was given. In 
fact, his mechanism also covered cases of parenthetical 
placement, scrambling, relative clause extraposition 
and heavy NP shift. 



5 Contractions on Anchors 

An LTAG along with the operations of sub- 
stitution and adjunction also has the implicit 
operation of lexical insertion (represented as 
the diamond mark in Fig. |TT|) . Under this 
view, the LTAG trees are taken to be tem- 
plates. For example, the tree in Fig. [ll] is 
now represented as {a(eat), {1, 2.1, 2.2}}. 



a(cats) 




Figure 11: Lexicalization in a LTAG. 

If we extend the notion of contraction in 
the conjoin operation together with the oper- 
ation of lexical insertion we have the following 
observations: The two trees to be used by the 
conjoin operation are no longer strictly lexi- 
calized as the label associated with the dia- 



11 A comparision of this paper's approach with the 
derivational machinery i n CCG and the devices o f 3-D 



coordination is done in ( Sarkar and Joshi, 199(;| ). 



mond mark is a preterminal. Previous uses 
of conjoin applied to two distinct trees. If the 
lexicalization operation is to apply simultane- 
ously, the same anchor projects two elemen- 
tary trees from the lexicon. The process of 
contraction ensures that the anchor is placed 
into a pair of LTAG tree templates with a sin- 
gle lexical insertion. 

Gapping. Using this extension to conjoin, 
we can handle sentences that have the "gap- 



ping" construction like sentence |(5) . 

(5) John ate bananas and Bill strawber- 



ries. 



The conjoin operation applies to copies of 
the same elementary tree when the lexical an- 
chor is in the contraction set. For example, let 
a(eats) be the tree selected by eats. The co- 
ordination of a(eats){2.i} with a copy of itself 
and the subsequent derivation tree is depicted 
in Fig. pP 



s 

S and^Jj^^ 
NPj VP NP^___VP 



Conj(and) 



a(eats) {21) a(eats) (2 1} 




Figure 12: Handling the gapping construction 
using contractions. 

An extension of the approach here will be 
to permit the conjoin operation to create con- 
tractions on all the nodes in contraction sets 
that it dominates during a derivation, allow- 
ing us to recognize cases of gapping such as: 
John wants Penn to win and Bill, Princeton. 
and John wants to try to see Mary and Bill, 
Susan. 

Coordinating Ditransitive verbs. In sen- 
tence |(6)| if we take the position that the 
string Mary a book is not a constituent (i.e. 
give has a structure as in Fig. [Tj|), then we 
can use the notion of contraction over the an- 



12 In English, following Ross (197C ), the anchor goes 
to the left conjunct. 



chor of a tree to derive the sentence in |(6)| . 
The structure we derive is shown in Fig. [TJ|. 

(6) John gave Mary a book and Susan a 
flower. 



NPJ, VP 




Figure 13: Tree for a ditransitive verb in 
LTAG. 




gave Mary a book Susan a flower 

Figure 14: Derived tree for John gave Mary a 
book and Susan a flower. 

Interactions. Permitting contractions on 
multiple substitution and adjunction sites 
along with contractions on the anchor al- 
low the derivation of sluicing structures such 
as £7) (where the conjunct Bill too can be in- 



terpreted as [John loves] Bill too or as Bill 
[loves Mary] fooP|. 



(7) 



John loves Mary and Bill too. 



6 Parsing Issues 

This section discusses parsing issues that arise 
in the modified TAG formalism that we have 
presented. We do not discuss general issues in 
parsing TAGs, rather we give the appropriate 
modifications that are needed to the existing 
Earley-type parsing algorithm for TAGs due 



to |Schabes and Joshi (1988|) . 

The algorithm relies on a tree traversal that 
scans the input string from left to right while 
recognizing the application of the conjoin op- 
eration. The nodes in the elementary trees 
are visited in a top-down left to right manner 



13 Whether this should be d erived syntactica lly is 
controversial, for example, see ( |3tccdman, 1990| ). 



(Fig. 15). Each dot in Fig. 15 divides the tree 
into a left context and a right context, en- 
abling the algorithm to scan the elementary 
tree while trying to recognize possible appli- 
cations of the conjoin operation. 




Figure 15: Example of a tree traversal 

The derived structure corresponding to a 
coordination is a composite structure built by 
applying the conjoin operation to two elemen- 
tary trees and an instantiation of the coordi- 
nation schema. The algorithm never builds 
derived structures. It builds the derivation by 
visiting the appropriate nodes during its tree 
traversal in the following order (see Fig. 16). 



1 2 • • • 3 4 ■ ■ ■ 5 6 • • • 2' 7' • • • 3' 4' • • • 5' 6' • • • 7 8 

The algorithm must also compute the cor- 
rect span of the string for the nodes that have 



been identified via a contraction. Fig. 16 
gives the possible scenarios for the position 
of nodes that have been linked by a contrac- 
tion. When foot nodes undergo contraction, 
the algorithm has to ensure that both the foot 
nodes share the subtree pushed under them, 
e.g. 9 • • • 10 and 9' • • • 10' in Fig. 0(a). Sim- 
ilarly, when substitution nodes undergo con- 
traction, the algorithm has to ensure that the 
tree recognized due by predicting a substi- 
tution is shared by the nodes, e.g. 11 • • • 12 
and 11' • • • 12' in Figs. |J(b) and ||(c). The 
traversals 9 • • • 10 should span the same length 
of the input as 9' • • • 10', similarly for 11 • • • 12 
and 11' • • • 12'. Various positions for such 
traversals is shown in Fig. 16. A derivation 



is valid if the input string is accepted and 
each node in a contraction spans a valid sub- 
string in the input. The complete and formal 
description of the parsing algorithm is given 
in ( |Sarkar and Joshi, 1996| ). 




(11)^(12)^ ^(11)^(12) (irAi2-y 'an 



Figure 16: Moving the dot while recognizing 
a conjoin operation 

7 Conclusion 

We have shown that an account for coordina- 
tion can be given in a LTAG while maintain- 
ing the notion of a derivation structure which 
is central to the LTAG approach. We showed 
that fixed constituency can be maintained at 
the level of the elementary tree while account- 
ing for cases of non-constituent coordination 
and gapping. We discussed the construction 
of a practical parser for LTAG that can han- 
dle these cases of coordination. 
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